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Question 1 
Intent of Question 


The primary goals of this question were to assess a student’s ability to (1) compare features of two 
distributions of data displayed in boxplots and (2) identify statistical measures that are important in 
making decisions based on data sets. 


Solution 
Part (a): 
The median salary is approximately the same for both corporations. The range and interquartile range 
of the salaries are greater for Corporation A than for Corporation B. ‘The two highest salaries at 
Corporation A are outliers while Corporation B has no outliers. 
Part (b): 
(i) Five years after starting, at least 3 out of 30 (10%) of the salaries at Corporation A are greater than 
the maximum salary at Corporation B. If I accept the offer from Corporation A, I might be able to 
make a higher salary at Corporation A than at Corporation B. 
(ii) Five years after starting, the minimum salary at Corporation B is greater than at Corporation A. In 
fact, at Corporation A it looks like some people are still making the starting salary of $36,000 and 


never received a raise in the five years since they were hired. So if 1 work at Corporation A, I might 
never receive a raise in salary. 


Scoring 
Parts (a) and (b) are scored as essentially correct (E), partially correct (P), or incorrect (I). 
Part (a) is scored as follows: 
Essentially correct (E) if the response includes the following four components: 
1. A correct comparison of center. 
2. Acorrect comparison of spread. 
3. Adiscussion of the outliers for Corporation A. 
4. ‘The response is in context. 
Partially correct (P) if the response includes only three of the four components. 


Incorrect (I) if the response includes at most two of the four components. 


Note: Any mention of shape should be ignored because complete shape information cannot be 
determined from a boxplot. 
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Question 1 (continued) 
Part (b) is scored as follows: 
Essentially correct (E) if the response includes the following four components: 
1. In part (b-i) a relevant statistical measure is identified (or described) or a relevant statistical 
comparison is provided that supports the choice of Corporation A. 
2. In part (b-i) an explanation is provided for why the measure or comparison is relevant. 
3. In part (b-ii) a relevant statistical measure is identified (or described) or a relevant statistical 
comparison is provided that supports the choice of Corporation B. 
4. In part (b-ii) an explanation is provided for why the measure or comparison is relevant. 
Partially correct (P) if the response includes only two or three of the four components. 
Incorrect (I) if the response includes none or one of the four components. 
Note: If a response does not provide a statistical measure or comparison in part (b-i) or (b-ii), the 
second and fourth components can still be satisfied if an acceptable explanation is provided that 
would follow from a relevant statistical measure or comparison. For example, if the response in 
part (b-i) only states “At Corporation A, I have the potential to earn a higher salary,” the second 
component is satisfied. 
4 Complete Response 
Both parts essentially correct 
3 Substantial Response 
One part essentially correct and one part partially correct 
2 Developing Response 
One part essentially correct and one part incorrect 
OR 
Both parts partially correct 


1 Minimal Response 


One part partially correct and one part incorrect 
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Question 2 
Intent of Question 


The primary goals of this question were to assess a student’s ability to (1) use confidence intervals to test a 
question about a proportion and (2) understand the relationship between sample size and margin of error 
in a confidence interval for a proportion. 


Solution 
Part (a): 


(i) No. The confidence interval is (0.09, 0.21), which includes the value of 0.20. Therefore, it is 


plausible that the computer program is generating discounts with a probability of 0.20, and the 
confidence interval does not provide convincing statistical evidence that the program is not 
working as intended. 


(ii) No. The confidence interval includes values from 0.09 to 0.21, so any value in that interval is a 
plausible value for the probability that the computer is using to generate discounts. 


Part (b): 


The formula for computing the margin of error for a proportion includes the square root of the sample 
size in the denominator. For a random sample that is four times the size of the original sample, the 
margin of error can be determined by dividing the margin of error of the original sample by two. 
Therefore, the new margin of error is 0.03. 


Part (c): 


Using the margin of error of 0.03 obtained from the second sample, the confidence interval for p is 
0.15 + 0.03 or (0.12,0.18). The interval does not include 0.20, and therefore, there is convincing 


evidence that the computer program is not working as intended and is not generating discounts with a 
probability of 0.20. 


Scoring 


This question is scored in four sections. Section 1 consists of part (a-i); section 2 consists of part (a-ii): 
section 3 consists of part (b); and section 4 consists of part (c). Sections 1, 2, 3, and 4 are scored as 
essentially correct (E), partially correct (P), or incorrect (J). 


Section 1 is scored as follows: 


Essentially correct (E) if the response states that because the interval contains 0.20, it does not provide 
convincing statistical evidence that the computer program is not working as intended. 


Partially correct (P) if the response indicates that it is necessary to check whether the value of 0.20 is in 
the computed interval, but there are errors in implementation. Examples of errors include: 


e ‘The response notes that 0.20 is within the interval but does not draw a conclusion. 


e ‘The response has an arithmetic error in the computation of the endpoints of the interval but 
provides a correct conclusion with justification that is consistent with the computed interval. 
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Question 2 (continued) 


Incorrect (I) if the response does not recognize how to use the confidence interval to check whether the 
computer is working correctly; 

OR 
if the response states that the interval shows that the proportion is equal to 0.20; 

OR 
if the response notes that 0.20 is within the interval and concludes that the program is not working as 
intended; 

OR 
if the response otherwise does not meet the criteria for E or P. 


Section 2 is scored as follows: 


Essentially correct (E) if the response concludes that there is not convincing statistical evidence that 
the computer program generates the discount with a probability of 0.20 AND justifies the conclusion 
by noting that there are values other than 0.20 in the interval. 


Partially correct (P) if the response correctly concludes that there is not convincing evidence that the 
computer program generates the discount with a probability of 0.20, but provides incomplete 
reasoning to justify the conclusion. 


Examples of incomplete reasoning include: 
e stating that 0.20 is a plausible value for the proportion of discounts without giving further 
explanation; 
e indicating that having 0.20 in the interval does not prove that 0.20 is the true proportion; and 
e providing a generic statistical argument, such as reference to the fact that the null hypothesis 
should never be accepted, or stating that not rejecting the null hypothesis is not proof that 0.20 
is the true proportion of bills discounted. 


Note: If an incorrect interval is computed in part (a-i) that does not contain 0.20, and in part (a-ii) the 
response concludes that the program is not generating discounts with a probability of 0.20 because 
Q.20 is not in the interval, section 2 is scored as P. 


Incorrect (I) if the response states that there is evidence that the computer program generates the 
discount with a probability of 0.20: 

OR 
if the response correctly concludes that there is not convincing evidence that the computer program 
generates the discount with a probability of 0.20 AND provides incorrect or no justification; 

OR 
if the response otherwise does not meet the criteria for E or P. 


Section 3 is scored as follows: 
Essentially correct (E) if the response gives the correct value of 0.03 as the new margin of error by 
using the correct formula or by providing a correct explanation that recognizes that quadrupling the 


sample size divides the margin of error from the original sample by two. 


Note: If the response relies on the margin of error formula and calculates a value that would round to 
0.030, the response is scored as E. 
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Question 2 (continued) 


Partially correct (P) if the response uses the correct margin of error formula or provides a correct 
explanation that recognizes that quadrupling the sample size divides the margin of error from the 
original sample by two, but calculates an incorrect margin of error that is less than 0.06: 

OR 
if the response does not calculate a margin of error but provides a correct explanation that recognizes 
that quadrupling the sample size divides the margin of error from the original sample by two; 

OR 
if the response gives the correct margin of error without correct justification. 


Incorrect (I) if the response does not recognize in any way that the new margin of error depends on the 
square root of the sample size: 
OR 
if the response calculates a margin of error that is greater than or equal to 0.06; 
OR 
if the response otherwise does not meet the criteria for E or P. 


Section 4 is scored as follows: 


Essentially correct (E) if the conclusion states that there is convincing evidence that the computer 
program is not working as intended because, based on the new margin of error, the interval does not 
contain 0.20; 

OR 
if the conclusion states the intended value of 0.20 is greater than the upper boundary of 0.18; 

OR 
if the conclusion states the margin of error is smaller than the difference between the sample 
proportion and the intended long-run proportion of 0.20. 


Notes: 
e Ifthe margin of error was computed incorrectly in part (b), but a correct answer to part (c) is 
consistent with this incorrect margin of error, section 4 is scored as E. 
e Ifnospecific margin of error or confidence interval was given in parts (b) or (c), but the 
response provides a complete and correct conclusion with reference to a smaller margin of error 
(or narrower confidence interval), section 4 is scored as E. 


Partially correct (P) if an interval is incorrectly constructed using the margin of error from part (b), but a 
correct conclusion is given and justified for the computed interval: 

OR 
if an interval is computed correctly using the margin of error from part (b) and an argument is made 
based on whether or not 0.20 is in the interval, but the conclusion is incorrect: 

OR 
if no specific margin of error or confidence interval has been given in parts (b) or (c), but the response 
concludes that because the margin of error has decreased (or the confidence interval is narrower), 0.20 
is not in the interval. 


Incorrect (I) if an interval is computed correctly using the margin of error from part (b) and a correct 
conclusion is given, but no argument is made based on whether or not 0.20 is in the interval; 

OR 
if the response otherwise does not meet the criteria for E or P. 
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Question 2 (continued) 


Notes: 
e Ifaresponse includes a confidence level, the level can be ignored because no confidence level 
was provided. 
e Ifthe response bases a conclusion on the relative location of 0.20 within the interval 
(for example, 0.20 is near the edge of the interval), the response is scored as I. 
e A response that provides additional incorrect explanation lowers the score in section 4 by one 
level (that is, from E to P, or from P to J). 


Fach essentially correct (E) section counts as 1 point. Each partially correct (P) section counts as % point. 


4 Complete Response 

3 Substantial Response 
2 Developing Response 
1 Minimal Response 


If a response is between two scores (for example, 2% points), use a holistic approach to decide whether to 
score up or down, depending on the overall strength of the response and communication. 
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Question 3 

Intent of Question 
The primary goals of this question were to assess a student’s ability to (1) perform a probability calculation 
from a discrete random variable; (2) calculate the expected value of a discrete random variable; (3) perform 
a conditional probability calculation from a discrete random variable; and (4) use probabilistic thinking to 
make a prediction about how an expected value will change given a condition about the random variable. 
solution 
Part (a): 

The probability that at least one A'l’M is working when the mall opens is: 

PX > 1) = 0.21 + 0.40 + 0.24 = 0.85. 

Part (b): 

The expected value of the number of AT’Ms that are working when the mall opens is: 

EX) = 0(0.15) + 1(0.21) + 2(0.40) + 3(0.24) = 1.73 machines. 

Part (c): 

The probability that all three A’l’Ms are working when the mall opens, given that at least one A'l’M is 


working is: 


PX=S3andx 21) RxX=3) 0.2 
= > — ee ee ~) 
P(X =3|X2>1) Ea POSS) 7 Dae ¥ 0282 


HS 


© 


Part (d): 
Given that at least one AT’M is working when the mall opens, the expected value of the number of 
working ATMs would be greater than the expected value calculated in part (b). By eliminating the 


possibility of OQ working AT’Ms, the probabilities for 1, 2, and 3 working AT’Ms all increase 
proportionally, so the expected value must increase. 


scoring 
Parts (a), (b), (c), and (d) are scored as essentially correct (E), partially correct (P), or incorrect (I). 
Part (a) is scored as follows: 
Essentially correct (E) if the probability is computed correctly with work shown. 
Partially correct (P) if the correct answer is given, but no work is shown; 
OR 


if appropriate work is shown but the answer is incorrect or missing; 
OR 
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Question 3 (continued) 


if one of the incorrect cumulative probabilities P(X <1), P(X <1), or P(X > 1)is stated AND 
computed correctly. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Note: The probability can be calculated as 1—- P(X = 0) =1-—0.15 = 0.85. 


Part (b) is scored as follows: 
Essentially correct (E) if the expected value is computed correctly with work shown. 


Partially correct (P) if the correct answer is given, but no work is shown; 
OR 
if appropriate work is shown but the answer is incorrect or missing. 


Incorrect (I) if the response does not meet the criteria for E or P. 
Part (c) is scored as follows: 


Essentially correct (E) if the probability is computed correctly, with work shown that includes correct 
numerical values for both the numerator and denominator. 


Partially correct (P) if the response includes a numerator and denominator in calculating the conditional 
probability, with one (numerator or denominator) correct in numerical value and the other incorrect: 

OR 
if the correct answer is given, but no work is shown. 


Incorrect (I) if the response does not meet the criteria for E or P. 
Part (d) is scored as follows: 


Essentially correct (E) if the response provides the correct answer (greater than) with a reasonable 
explanation based on the fact that with X = O eliminated, the probabilities for X =1, X = 2, and 
X = 8 all increase: 

OR 
if the response provides the correct answer (greater than) with a reasonable explanation based on the 
fact that with xX =O eliminated, the balance point of the distribution increases: 

OR 
if the response provides the correct answer (greater than) and the conditional expected value is 
computed correctly with work shown. 


Partially correct (P) if the response provides the correct answer (greater than) with a weak explanation, 
such as “Yes, because 0 is eliminated”: 
OR 


© 2015 The College Board. 
Visit the College Board on the Web: www.collegeboard.org. 


AP® STATISTICS 
2015 SCORING GUIDELINES 


Question 3 (continued) 


if the response provides the correct answer (greater than) with a reasonable but incorrect attempt to 
calculate the conditional expected value using a revised probability distribution with only the values 
X=1, X =2,and X = 3, and corresponding probabilities greater than or equal to those in the 
original probability distribution. 
Incorrect (I) if the response does not provide the correct answer (greater than): 

OR 
if the response provides the correct answer (greater than) with an incorrect explanation or no 
explanation. 


Note: ‘he conditional expected value is: 


0.21 0.40 0.24 7 | 
(O22) + 2 oe + 3222 ~ 110.247) + 2(0.471) + 3(0.282) = 2.04 machines. 


Each essentially correct (E) part counts as 1 point. Each partially correct (P) part counts as % point. 


4 Complete Response 

3 Substantial Response 
2 Developing Response 
1 Minimal Response 


If a response is between two scores (for example, 2% points), use a holistic approach to decide whether to 
score up or down, depending on the overall strength of the response and communication. 
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Question 4 
Intent of Question 


The primary goal of this question was to assess a student’s ability to identify, set up, perform, and interpret 
the results of an appropriate hypothesis test to address a particular question. More specific goals were to 
assess a student’s ability to (1) state appropriate hypotheses; (2) identify the appropriate statistical test 
procedure and check appropriate conditions for inference; (3) calculate the appropriate test statistic and 
p-value; and (4) draw an appropriate conclusion, with justification, in the context of the study. 


Solution 


step 1: States a correct pair of hypotheses. 


Let Dasy represent the population proportion of adults similar to those in the study who would have 


developed colon cancer within the six years of the study if they had taken a low-dose aspirin each day. 
similarly, let p,},, represent the population proportion of adults similar to those in the study who 


would have developed colon cancer within the six years of the study if they had taken a placebo each 
day. 


The hypotheses to be tested are Ho : Dasyn = Pplac VETSUS Hag : Dasp < Pplag OF equivalently, 
Hy Dasy —Daige =) Versus. Hy.2 Di. Diige <¥: 

Step 2: Identifies a correct test procedure (by name or by formula) and checks appropriate conditions. 
The appropriate procedure is a two-sample z-test for comparing proportions. 
Because this is a randomized experiment, the first condition is that the volunteers were randomly 
assigned to one treatment group or the other. The condition is satisfied because we are told that the 
volunteers were randomly assigned to take a low-dose aspirin or a placebo. 
The second condition is that the sample sizes are large, relative to the proportions involved. ‘he 
condition is satisfied because all sample counts are large enough; that is, 15 with colon cancer in 


aspirin group, 26 with colon cancer in placebo group, 500 — 15 = 485 cancer-free in aspirin group, and 
500 — 26 = 474 cancer-free in placebo group. 


step 3: Calculates the appropriate test statistic and p-value. 


15 26 


The sample proportions who developed colon cancer are Dis = 509 > 0.030 and Dee = 509 > O:052: 
. ah 15 + 26 
The combined sample proportion who developed colon cancer is Doombined = Gan eaw = 0.041. 
500 + 500 
The test statisticis z = ———— ~ —1.75 (-1.7542 from calculator). 
0.041(1 — 0.040 ( ‘ aa 
The p-value is P(Z < —1.75) = 0.0401 (0.0397 from calculator), where Z has a standard normal 


distribution. 
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Question 4 (continued) 
step 4: States a correct conclusion in the context of the study, using the result of the statistical test. 


Because the p-value is less than the given significance level of a = 0.05, we reject the null hypothesis. 
The data provide convincing statistical evidence that the proportion of all adults similar to the 
volunteers who would develop colon cancer if they had taken a low-dose aspirin every day is less than 
the proportion of all adults similar to the volunteers who would develop colon cancer if they had not 
taken a low-dose aspirin every day. 


Scoring 


Steps 1, 2, 3, and 4 are scored as essentially correct (E), partially correct (P), or incorrect (J). 
Step 1 is scored as follows: 


Essentially correct (E) if the response identifies correct parameters AND both hypotheses are labeled 
and state the correct relationship between the parameters. 


Partially correct (P) if the response identifies correct parameters OR states correct relationships, but not 
both. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Note: Hither defining the parameters in context, or simply using common parameter notation with 
subscripts clearly relevant to the context, such as p,,, and Dpjgc, 1s sufficient. 


Step 2 is scored as follows: 


Essentially correct (E) if the response correctly includes the following three components: 
1. Identifies the correct test procedure (by name or by formula). 
2. Notes that the use of random assignment satisfies the randomness condition. 
3. Checks for approximate normality of the test statistic by citing that all four counts are larger 
than some standard criterion such as 5 or 10. 


Partially correct (P) if the response correctly includes only two of the three components. 
Incorrect (I) if the response correctly includes at most one of the three components. 


Notes: 

e For the randomness component, it is (minimally) acceptable to say “random assignment — 
check” but not acceptable to say “random — check” or “SRS — check.” ‘The important concept 
here is that it is random assignment, and not random sampling, that is required. If the response 
implies that the study used a random sample, the randomness component is not satisfied, 
regardless of whether random assignment is correctly addressed. 

e ‘The normality check may use the expected counts under the null hypothesis in place of 
observed counts. 
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Question 4 (continued) 
Step 3 is scored as follows: 


Essentially correct (E) if the response correctly calculates both the test statistic and a p-value that is 
consistent with the stated alternative hypothesis. 


Partially correct (P) if the response correctly calculates the test statistic but not the p-value; 
OR 
if the response calculates the test statistic incorrectly but then calculates the correct p-value for the 
computed test statistic; 
OR 
if the response reports the correct p-value but no calculations or test statistic are shown. 


Incorrect (I) if the response fails to meet the criteria for E or P. 


Note: The p-value is considered correct if it is consistent with the alternative stated in the response 
and the calculated test statistic, even if those are incorrect. 


Step 4 is scored as follows: 


Essentially correct (E) if the response provides a correct conclusion in context, with justification based 
on linkage between the p-value and the given @ = 0.05. 


Partially correct (P) if the response provides a correct conclusion, with linkage to the p-value, but not in 
context; 

OR 
if the response provides a correct conclusion in context, but without justification based on linkage to 
the p-value. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Notes: 

e ‘The conclusion must be related to the alternative hypothesis. 

e Ifthe p-value is incorrect, then step 4 is scored as E if the response includes proper linkage and 
a conclusion in context consistent with that p-value. 

e Ifthe p-value is less than 0.05, wording that states or implies that the alternative hypothesis is 
proven lowers the score one level (that is, from E to P or P to I) in step 4. 

e Ifthe p-value is incorrect and greater than 0.05, wording that states or implies that the null 
hypothesis is accepted lowers the score one level (that is, from E to P or P to I) in step 4. 
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Question 4 (continued) 


Fach essentially correct (E) step counts as 1 point. Each partially correct (P) step counts as % point. 


4 


3 


2 


1 


Complete Response 
Substantial Response 
Developing Response 


Minimal Response 


If a response is between two scores (for example, 2% points), use a holistic approach to decide whether to 
score up or down, depending on the overall strength of the response and communication. 
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Question 5 
Intent of Question 
The primary goals of this question were to assess a student’s ability to (1) use the information provided by 
a scatterplot to describe the relationship between two quantitative variables; (2) interpret and use the 
information given by lines displayed on a scatterplot; and (3) use a regression equation to estimate a 
predicted value of y fora given x value. 
Solution 


Part (a): 


There is a moderately strong, positive, linear relationship between height and arm span so that taller 
students tend to have longer arm spans. 


Part (b): 
(i) The line in Graph 2 is the one that is helpful. For each student, the graph illustrates whether arm 


Span is equal to height (points on the line), arm span is greater than height (points above the line), 
or arm span is less than height (points below the line). 


(ii) 
Tall Rectangle short Rectangle 


Part (c): 


The predicted arm span is y = 11.74 + 0.8247x = 11.74 + 0.8247(61) = 62.05 inches. 


Scoring 
Parts (a), (bo), and (c) are scored as essentially correct (E), partially correct (P), or incorrect (I). 
Part (a) is scored as follows: 
Essentially correct (E) if the response includes the following four components: 
1. ‘The relationship is approximately linear. 
2. ‘The relationship is positive. 
3. There is a moderate to strong association (or relationship). 
4 ‘The response is given in context. 


Partially correct (P) if the response includes only two or three of the four components. 


Incorrect (I) if the response includes at most one of the four components. 
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Question 5 (continued) 


Correlation alone is not sufficient to indicate a linear relationship. However, linear correlation is 
sufficient to indicate a linear relationship. 

Reporting that the correlation coefficient is 0.81, the correlation coefficient is close to 1.0, or 
some other value or range of values for the correlation coefficient is not sufficient to satisfy 
component 3. 


Part (b) is scored as follows: 


Essentially correct (E) if the response contains the following four components: 


di 
Zi 


4. 


The line in Graph 2 is selected. 

A reasonable explanation for selecting the line in Graph 2 that links the body shapes to the 

regions defined by the line is given. The explanation should indicate that square shapes (arm 

Span is equal to height) are represented by points on the line, tall rectangle shapes (arm span 

less than height) are represented by points below the line, and short rectangle shapes (arm 

Span is greater than height) are represented by points above the line. 

Correct counting of body types demonstrated by reporting correct frequencies (3, 4, and 5) or 

reporting correct proportions oie ce ue om and a for the square, tall rectangle, and short 
4 12°3 12 12 

rectangle body types, respectively. 

The correct frequencies (3, 4, and 5) are reported in the table. 


Partially correct (P) if the response includes only two or three of the four components. 


Incorrect (I) if the response includes at most one of the four components. 


Notes: 
e 


selecting the regression line on Graph 1 cannot satisfy either component 1 or component 2. 
To satisfy component 1, it is sufficient to refer to the y = x line without explicitly mentioning 
Graph 2. 

Use of incorrect labels, such as regression line or least-squares regression line, in referring to 
the y = x line in Graph 2 is an incorrect use of terminology that should be strongly 
discouraged, but it is ignored in this scoring rubric. 

The explanation for selecting Graph 2 is acceptable if it explicitly includes at least two of the 
following: (i) square body shapes (arm span equal to height) are represented by points on the 
line, (ii) tall rectangle body shapes (arm span less than height) are represented by points below 
the line, or (iii) short rectangle body shapes (arm span greater than height) are represented by 
points above the line. 

Incorrectly counting the points on the y = x line in Graph 2 is considered a minor error. 
Frequencies (2, 4, and 5) are accepted for both component 3 and component 4. Reporting 


aaa eeree 


corresponding proportions in the table satisfies component 3 but 


ga ie tet Fe LA ae. 
does not satisfy component 4. 

Frequencies reported in the table as (2, 5, 4) or (8, 5, 4), interchanging the counts for tall and 
short rectangle body shapes, satisfies component 3, but does not satisfy component 4. 

5 4 | mF 3.5 4 


Z 








Reporting corresponding proportions, , In the table does not satisfy 











either component 3 or component 4. 
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Question 5 (continued) 
Part (c) is scored as follows: 


Essentially correct (E) if the response contains the following three components: 
1. Correct formula for predicting arm span with 61 inserted for x. 
2. Correct value for the predicted arm span. 
3. Units for the predicted arm span given as inches. 


Partially correct (P) if the response includes only two of the three components. 
Incorrect (I) if the response includes at most one of the three components. 


Notes: 


e Any value for the predicted arm span between 61.5 and 62.5 is acceptable; values outside the 
interval do not satisfy component 2. This allows for inaccuracy in obtaining a value for the 
predicted arm span directly from the line displayed on Graph 1 and for rounding in applying the 
prediction formula; but it excludes predictions based on the y = x line and other unreasonable 
predictions. 

e Reporting a prediction equation that is similar to the equation given in the stem, with 61 
inserted for x, satisfies components 1 and 2 if the value of the predicted arm span is between 
61.5 and 62.5 inches. Otherwise, neither of those two components is satisfied. This could occur 
if a student enters the data from the graph into a calculator to compute the least squares 
regression line, instead of using the formula provided in the stem. 


4 Complete Response 

All three parts essentially correct 
3 Substantial Response 

Two parts essentially correct and one part partially correct 
2 Developing Response 


Two parts essentially correct and one part incorrect 


OR 
One part essentially correct and two parts partially correct 
OR 
One part essentially correct, one part partially correct, and one part incorrect 
OR 
Three parts partially correct 
1 Minimal Response 
One part essentially correct and two parts incorrect 
OR 


Two parts partially correct and one part incorrect 
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Question 6 
Intent of Question 


The primary goals of this question were to assess a student’s ability to (1) describe how sample data would 
differ using two different sampling methods; (2) describe the sampling distribution of the sample mean for 
two different sampling methods; and (3) choose the sampling method that will result in the best estimate 
of the population mean. 


Solution 
Part (a): 


No, a sample obtained using Method 2 will not be representative of all tortillas made that day. The 
sample obtained using Method 2 will only represent the tortillas from one production line, not from the 
entire population because the distributions of diameters for the two production lines are different. 


Part (b): 


Method 1 was most likely used to select this sample. The bimodal shape in the histogram of sample 
data indicates that tortillas were selected from both production lines, which is what would happen 
using Method 1. Method 2 would be likely to produce a unimodal distribution of diameters centered at 
either 5.9 inches or 6.1 inches. 


Part (c): 
Method 2 would result in less variability in the sample of 200 tortillas on a given day because the 
sample comes from only one production line. Because the distributions of diameters are not the same 
for the two production lines, selecting tortillas from both lines as in Method 1 would result in more 
variable sample data. 


Part (d): 


The sampling distribution of the sample mean diameter for samples obtained using Method 1 would be 





approximately normal with mean 6 inches and standard deviation ou = 0.0078 inch. 


J/200 
Part (e): 


Method 1 would result in less variability in the sample means over the 365 days, because with Method 
2, roughly half of the sample means will be around 5.9 inches and the other half will be around 6.1 
inches. With Method 1, however, the sample means will all be very close to 6 inches, as indicated by 
the standard deviation in part (d). 


Part (f): 


Method 1 is more likely to produce a sample mean close to 6 inches. Because the sample mean is an 
unbiased estimator for both methods, the manager should pick the method that would result in less 
variability in the distribution of the sample mean. Based on the answer to part (e), Method 1 results in 
less variability in the distribution of the sample mean. 
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Question 6 (continued) 


Scoring 


This question is scored in three sections. Section 1 consists of parts (a), (b), and (c), section 2 
consists of part (d), and section 3 consists of parts (e) and (f). Sections 1, 2, and 3 are scored as 
essentially correct (E), partially correct (P), or incorrect (I). 


Section 1 is scored as follows: 


Essentially correct (E) if the response includes the following three components: 

1. In part (a) the response says no AND either argues that the sample will only be selected from 
one production line and not the entire population (that is, the sample will only represent one 
production line) OR argues that the tortillas from the two productions lines are different. 

2. In part (b) the response chooses Method 1 AND refers to a relevant characteristic of the 
histogram (shape, center, or variability) that matches what would be expected when using 
Method 1 (or that does not match what would be expected when using Method 2). 

3. In part (c) the response chooses Method 2 AND either justifies by stating that the sample 
comes from only one production line (does not come from both production lines) OR justifies by 
comparing the possible range of diameters for the two methods. 


Partially correct (P) if the response includes only two of the three components. 
Incorrect (I) if the response includes at most one of the three components. 


Note: If a response includes more than one justification in an individual part, score the weaker of the 
two justifications. For example, a response for part (a) that says “No, because only one line was 
selected and because the sample size was too small” does not satisfy the first component because the 
sample size argument is incorrect. 


Section 2 is scored as follows: 


Essentially correct (E) if the response includes the following three components: 
1. The shape is approximately normal. 
2. ‘The mean is 6 inches. 
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3. The standard deviation is inch. 


Partially correct (P) if the response includes only two of the three components. 
Incorrect (I) if the response includes at most one of the three components. 
Note: It is not necessary to include units (inches) for the mean or standard deviation. 
Section 3 is scored as follows: 
Essentially correct (E) if the response includes the following two components: 
1. In part (e) the response chooses Method 1 AND describes the sampling distribution of the 
sample mean for Method 2 as having some sample means close to the mean of production 


line A (5.9 inches) and the other sample means close to the mean of production line B 
(6.1 inches). 
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Question 6 (continued) 


2. In part (f) the response chooses Method 1 AND 
=" refers to a correct answer in part (e): 
OR 
=" describes the sampling distribution of the sample mean for Method 2 as having some 
sample means close to the mean of production line A (5.9, less than 6) and the other 
sample means close to the mean of production line B (6.1, greater than 6); 
OR 
=" argues that on a single day, it would be preferable to get a sample with a mean around 
6 rather than getting a sample that would have a mean around 5.9 (less than 6) or a 
mean around 6.1 (greater than 6). 


Partially correct (P) if the response includes one of the two components. 
Incorrect (I) if the response does not meet the criteria for E or P. 


Notes: 
e In part (e), the response must be clear that there is more than one mean being described 
(365 sample means). 
o Correct: The sample means will be around 56.9 or 6.1. 
o Notcorrect: The sample mean will be around 5.9 or 6.1. 


e Parts (e) and (f) are not satisfied if the response does not imply the sample means vary from the 
population means in both methods (or does not imply that the sample mean varies from the 
population mean when making the single day argument in part (f)). However, if the mistake is 
made in part (e), do not penalize the response in part (f) for the same mistake. 

o Correct: The sample means will be around 5.9 or 6.1 instead of close to 6. 
o Notcorrect: The sample means will be 5.9 or 6.1 instead of close to 6. 
o Notcorrect: The sample means will be 6 instead of around 5.9 or 6.1. 
4 Complete Response 
All three sections essentially correct 
3 Substantial Response 
T'wo sections essentially correct and one section partially correct 


2 Developing Response 


T'wo sections essentially correct and one section incorrect 


OR 
One section essentially correct and either one or two sections partially correct 
OR 
All three sections partially correct 
1 Minimal Response 
One section essentially correct and two sections incorrect 
OR 


T'wo sections partially correct and one section incorrect 
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